大量标记的医学图像对于准确检测异常是必不可少的,但是手动注释是劳动密集型且耗时的。自我监督学习(SSL)是一种培训方法,可以在没有手动注释的情况下学习特定于数据的功能。在医学图像异常检测中已采用了几种基于SSL的模型。这些SSL方法有效地学习了几个特定特定图像的表示形式,例如自然和工业产品图像。但是,由于需要医学专业知识,典型的基于SSL的模型在医疗图像异常检测中效率低下。我们提出了一个基于SSL的模型,该模型可实现基于解剖结构的无监督异常检测(UAD)。该模型采用解剖学意识粘贴(Anatpaste)增强工具。 Anatpaste采用基于阈值的肺部分割借口任务来在正常的胸部X光片上创建异常,用于模型预处理。这些异常类似于实际异常,并帮助模型识别它们。我们在三个OpenSource胸部X光片数据集上评估了我们的模型。我们的模型在曲线(AUC)下展示了92.1%,78.7%和81.9%的模型,在现有UAD模型中最高。这是第一个使用解剖信息作为借口任务的SSL模型。 Anatpaste可以应用于各种深度学习模型和下游任务。它可以通过修复适当的细分来用于其他方式。我们的代码可在以下网址公开获取:https://github.com/jun-sato/anatpaste。
translated by 谷歌翻译
This document presents endeavors to represent emotion in a computational cognitive architecture. The first part introduces research organizing with two axes of emotional affect: pleasantness and arousal. Following this basic of emotional components, the document discusses an aspect of emergent properties of emotion, showing interaction studies with human users. With these past author's studies, the document concludes that the advantage of the cognitive human-agent interaction approach is in representing human internal states and processes.
translated by 谷歌翻译
Target Propagation (TP) is a biologically more plausible algorithm than the error backpropagation (BP) to train deep networks, and improving practicality of TP is an open issue. TP methods require the feedforward and feedback networks to form layer-wise autoencoders for propagating the target values generated at the output layer. However, this causes certain drawbacks; e.g., careful hyperparameter tuning is required to synchronize the feedforward and feedback training, and frequent updates of the feedback path are usually required than that of the feedforward path. Learning of the feedforward and feedback networks is sufficient to make TP methods capable of training, but is having these layer-wise autoencoders a necessary condition for TP to work? We answer this question by presenting Fixed-Weight Difference Target Propagation (FW-DTP) that keeps the feedback weights constant during training. We confirmed that this simple method, which naturally resolves the abovementioned problems of TP, can still deliver informative target values to hidden layers for a given task; indeed, FW-DTP consistently achieves higher test performance than a baseline, the Difference Target Propagation (DTP), on four classification datasets. We also present a novel propagation architecture that explains the exact form of the feedback function of DTP to analyze FW-DTP.
translated by 谷歌翻译
Despite the impact of psychiatric disorders on clinical health, early-stage diagnosis remains a challenge. Machine learning studies have shown that classifiers tend to be overly narrow in the diagnosis prediction task. The overlap between conditions leads to high heterogeneity among participants that is not adequately captured by classification models. To address this issue, normative approaches have surged as an alternative method. By using a generative model to learn the distribution of healthy brain data patterns, we can identify the presence of pathologies as deviations or outliers from the distribution learned by the model. In particular, deep generative models showed great results as normative models to identify neurological lesions in the brain. However, unlike most neurological lesions, psychiatric disorders present subtle changes widespread in several brain regions, making these alterations challenging to identify. In this work, we evaluate the performance of transformer-based normative models to detect subtle brain changes expressed in adolescents and young adults. We trained our model on 3D MRI scans of neurotypical individuals (N=1,765). Then, we obtained the likelihood of neurotypical controls and psychiatric patients with early-stage schizophrenia from an independent dataset (N=93) from the Human Connectome Project. Using the predicted likelihood of the scans as a proxy for a normative score, we obtained an AUROC of 0.82 when assessing the difference between controls and individuals with early-stage schizophrenia. Our approach surpassed recent normative methods based on brain age and Gaussian Process, showing the promising use of deep generative models to help in individualised analyses.
translated by 谷歌翻译
In the field of reinforcement learning, because of the high cost and risk of policy training in the real world, policies are trained in a simulation environment and transferred to the corresponding real-world environment. However, the simulation environment does not perfectly mimic the real-world environment, lead to model misspecification. Multiple studies report significant deterioration of policy performance in a real-world environment. In this study, we focus on scenarios involving a simulation environment with uncertainty parameters and the set of their possible values, called the uncertainty parameter set. The aim is to optimize the worst-case performance on the uncertainty parameter set to guarantee the performance in the corresponding real-world environment. To obtain a policy for the optimization, we propose an off-policy actor-critic approach called the Max-Min Twin Delayed Deep Deterministic Policy Gradient algorithm (M2TD3), which solves a max-min optimization problem using a simultaneous gradient ascent descent approach. Experiments in multi-joint dynamics with contact (MuJoCo) environments show that the proposed method exhibited a worst-case performance superior to several baseline approaches.
translated by 谷歌翻译
使用三维(3D)图像传感器的智能监视一直在智能城市的背景下引起人们的注意。在智能监控中,实施了3D图像传感器获取的点云数据的对象检测,以检测移动物体(例如车辆和行人)以确保道路上的安全性。但是,由于光检测和范围(LIDAR)单元用作3D图像传感器或3D图像传感器的安装位置,因此点云数据的特征是多元化的。尽管迄今已研究了从点云数据进行对象检测的各种深度学习(DL)模型,但尚无研究考虑如何根据点云数据的功能使用多个DL模型。在这项工作中,我们提出了一个基于功能的模型选择框架,该框架通过使用多种DL方法并利用两种人工技术生成的伪不完整的训练数据来创建各种DL模型:采样和噪声添加。它根据在真实环境中获取的点云数据的功能,为对象检测任务选择最合适的DL模型。为了证明提出的框架的有效性,我们使用从KITTI数据集创建的基准数据集比较了多个DL模型的性能,并比较了通过真实室外实验获得的对象检测的示例结果。根据情况,DL模型之间的检测准确性高达32%,这证实了根据情况选择适当的DL模型的重要性。
translated by 谷歌翻译
针对目标的对话任务的先前研究缺乏关键观念,该观念在以目标为导向的人工智能代理的背景下进行了深入研究。在这项研究中,我们提出了目标引导的开放域对话计划(TGCP)任务的任务,以评估神经对话代理是否具有目标对话计划的能力。使用TGCP任务,我们研究了现有检索模型和最新强生成模型的对话计划能力。实验结果揭示了当前技术面临的挑战。
translated by 谷歌翻译
研究过程包括许多决定,例如如何应有资格以及在何处发表论文。在本文中,我们介绍了一个一般框架,以调查此类决策的影响。研究效果的主要困难是我们需要了解反事实结果,而实际上并非现实。我们框架的主要见解是灵感来自现有的反事实分析,其中研究人员将双胞胎视为反事实单位。提出的框架将一对彼此引用为双胞胎的论文。这些论文往往是平行的作品,在类似的主题和类似社区中。我们调查了采用不同决策的双论文,观察这些研究带来的研究影响的进展,并通过这些研究的影响来估算决策的影响。我们发布了我们的代码和数据,我们认为由于数据集缺乏反事实研究,因此这是非常有益的。
translated by 谷歌翻译
在本文中,我们报告了一项现场研究,在该研究中,我们在面包店使用了两个服务机器人作为促销活动。先前的研究探索了公共公共公众公共应用,例如购物中心。但是,需要更多的证据表明,服务机器人可以为真实商店的销售做出贡献。此外,在促销促销的背景下,客户和服务机器人的行为尚未得到很好的检查。因此,可以认为有效的机器人行为类型,并且客户对这些机器人的反应尚不清楚。为了解决这些问题,我们在面包店安装了两个远程操作的服务机器人将近2周,一个在入口处作为招待员,另一个在商店里推荐产品。结果表明,在应用机器人时,销售额急剧增加。此外,我们注释了机器人和客户行为的视频录制。我们发现,尽管放置在入口处的机器人成功吸引了路人的兴趣,但没有观察到访问商店的客户数量明显增加。但是,我们确认商店内部运行的机器人的建议确实产生了积极影响。我们详细讨论我们的发现,并为未来的研究和应用提供理论和实用建议。
translated by 谷歌翻译
一个由许多移动计算实体组成的自动移动机器人系统(称为机器人)吸引了研究人员的广泛关注,并阐明机器人的能力与问题的可溶性之间的关系是近几十年来的新兴问题。通常,只要没有任何机器人的数量,每个机器人都可以观察所有其他机器人。在本文中,我们提供了关于机器人观察的新观点。机器人不一定要观察所有其他机器人,而不管距离距离如何。我们称此新的计算模型瑕疵视图模型。在该模型下,在本文中,我们考虑了需要所有机器人在同一时刻收集的收集问题,并提出了两种算法来解决对抗性($ n $,$ n-2 $)中的收集问题 - 违法模型对于$ n \ geq 5 $(每个机器人最多观察$ n-2 $机器人在对手身上选择)和基于距离的(4,2)的模型(每个机器人在最接近的机器人最接近的机器人中分别观察到)分别,其中$ n $是机器人的数量。此外,我们提出了一个不可能的结果,表明在对抗性或基于距离(3,1)的模型中没有(确定性的)收集算法。此外,我们在放松的($ n $,$ n-2 $)中的聚会中表现出了不可能的结果。
translated by 谷歌翻译